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1-30. (Canceled) 

3 1 . (New) A system for identifying genes, comprising: 

a pattern database comprising patterns of amino acids; 

an input device for inputtiixg a genomic DNA sequence; and 

a processor which: . 

translates an open reading frame (ORF) of said DNA sequence into an ammo acMi 

txanslatioTx; .j 

locates in said amino acid translation occurrences of said patterns from said 

pattern database; and 

determines whether said open reading frame uicludes a putatave gene based on a 

number of sa.d patterns of am.no acids located in said amino acid translation of said ORF, and/or 
weighted values associated with said patterns of amino acids located in said amino acid 
translation of said ORF. 

32 (N.») Th= sy=«n ac.»tdn,g u> olata 3 1 , whcmn proc«»M <im^ a plurality of 

open ft.„,„ h, «id DNA °> 

Ino acid L^^^ -c-en-. of said pa«=n. ,o d~ ~ch .a.d plurahty «pa= 

reading frames includes a putative gene. 

,. „ m 32 wherein said patterns comprise biologically 

33. .(New) The system accordmg to claim :>z, wHwcm a f 

significant patterns of amino acids in amino acid sequences. 

,^;«o tr. claim 3 1 wherein said processor identifies a match of a 

34. (^3cw) The system accordmg to ciami J i.wncxcuia ^ 

pattern from said pattern database in said amino acid franslation. 
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35. (New) The system aocotding to claim 34. wherein said patterns are derived from a pai^nt 
database comprising at least one amino acid sequence. 

36. (New) The system according to claim 34, wherein said patterns are derived from a parent 
database comprising at least one amino acid sequence fragment. 

37. (New) The system according to claim 34, wherein said patterns are derived by using a 
pattern discovery algorithm. 



38. (New) The system 
Teiresias algorithm. 



according to claim 34. wherein said patterns are derived by using the 



39. (New) The system according to claim 34, wherein said ORF comprises a portion of said 
DNA sequence between a start codon isnd a stop codon. 

40 (Ne^v) The system according to claim 34, wherein said processor reports said ORF as a 
putative gene when a predetennined number of pattern matches is identified in said amino aad 

transld'tiion- 

41 (New) The system according to claim 34. wherein eachpattem is assigned a weight 
depending upon a relevance of said pattern in determining whether said ORF compnses a 
putative gene. 

42 (New) Th. .y.wn, ^.cording u, »l«m 34, wl»«in said ORF is report* as a p»tafivc 
sen. ^ »f weigh,, coneaponding io all p«.ems with matches in said ammo ac.d 
translation exceeds a predetermined threshold. 
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43. (New) The system according to claim 34. wherein said match is identified using a 
predetermined pattern tnatching algorithm. 

44. (New) The system according to claim 34, further comprising: 

a memory device for storing data and instructions to be executed by said processor. 

45 . (New) The system according to claim 34, further comprising: 
a display device for displaying an output from said processor. 

46. (New) A method of identi fying genes, comprising: 
providing a pattern database comprising patterns of amino acids; 
determining an open reading frame (ORF) in a genomic DNA sequence; 
generating an amino acid translation for said ORF; 

locating a match of a pattern from said pattern database in said amino acid translation; 

determining whether said ORF includes a putative gene based on a number of said 
matching patterns of amino acids located in said amino acid translation of said ORF, and/or a 
weighted value associated with a matching pattern located in said amino acid translation of said 

ORF; and 

displaying a result of said determining whether said ORF includes a putative gene. 

47. (New) The method according to claim 46, wherein said pattern database is generated 
from a database comprising at least one amino acid sequence. 

48. (New) The method according to claim 46. wherein said pattern database is generated from 
a database comprising at least one amino acid sequence fragment. 

49. (New) The method according to claim 46. further comprising: 

identiiying said ORF as a putative gene when a predetermined number of pattern 
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matches is identified in said amino acid translation. 

50 (New) The iT.ethod according to claim 46, fiirther comprising: 

assigning a weight to each pattern depending upon a relevance of said pattern m 
determining whether said ORF includes a putative gene, 

51 (New) The method according to claim 46, further comprising: 
displaying said match of said pattern in said amino acid translation. 

52 ^ew) The method according to claim 46. wherein said pattern database is generated 
using the Teiresias algorithm to derive said patterns from a parent database. 

53 (New) A prog^mable storage medium tangibly embodying a program of _ 

/ »' „j;«taiiM«cessine apparatus to perform a method 

machine-rcjuiable instructions executable by a digital processmg app 

for identifying genes, said method comprising: 

providing a pattern database comprising patterns of ammo acids; 

detennining an open reading frames (ORF) in a given genomic I>NA sequence; 

generating an amino acid translation for each ORF; 

locating a match of a pattern from said pattern database in said ammo acid 

translation; _ ^^^^^ ^^^^ , „,^ber of 

determmmg whether said UKJ- mciuu k ^ . . ^.tjc ^«H/nra 

.elated v.,u==.»cia«4wU. amatcWn. pa«n..oca»*in».da»«o.c.d«— of» 

"^'"^ displaying a «suU of said de.cn..ing wb«h.r said ORF include a .utaUvc 

gene. 

54. (New) The system according to claim 33. wherein said processor 
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detennines for each pattern in said pattern database whether the pattern is present in said amino 
acid translation by locating instances of said patterns in said amino acid translation, until a sum 
of weights conespondins to aU patterns with matches in said amino acid translation exceeds a 
predetem^ined threshold, at which po,«t said processor identifies said ORF as a putative gene. 

55 (New) The system according to claim 3 1 , further comprising: 

a pareirt database comprising a plurality of amino acid sequences, said patterns in said 
pattern database being derived from said pluraUty of amino acid sequences by using a pattern 
discovery algorithm; 

a n-emory device for storing data and instructions to be executed by said processor; and 
a display device for displaying an otrtput from said processor. 



56. 



(New) The system according to claim 55, wherein said open reading frame (ORF) 
comprises a portion of said DNA sequence between a start codon and a stop codon, 

wherein said processor identifies a match of a patten) from said pattern database m said 
amino acid h-anslation by using a predetermined pattern matching algorithm, 

wherein each pattern is assigned a weight depending upon a relevance of said pattern m 
detemiining whether said ORF comprises a putative gene, and . ^ . 

wherein said ORP is reported as a putative gene wher) either a predetermined number of 
pattern n^atches is identified in sa,d anrino acid translation, or a sum of weights correspondmg to 
all patterns with matches in said amino acid translation exceeds a predetermined threshold. 

57. (New) The system according to claim 31, wherein said processor accesses 
said pattern database to retrieve said patterns from said pattern database. 



58. (Nc«v) The system according to claim 31, wherein said processor 
electrically coupled to said input device and said pattem database. 



IS 
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59. (New) A sy$tem for identifying geoes, comprising: 

an input device which inputs a genomic DNA sequence; and 
a processor which: 

accesses a pattern database comprising a plurality of patterns of amino acids; 
translates an open reading frame (ORF) of said DNA sequence into an amino acid 

translation; 

locates in said amino acid translation occurrences of said patterns from said 

pattern database; and 

determines whether said open reading frame includes a putative gene based on a 

number of said patterns of amino acids located in said amino acid translation of said ORF, and/or 
weighted values associated with said patterns of amino acids located in said amino acid 
translation of said ORF. 

60. (New) A system for identifying genes, comprising: 

an input device which inputs a query genomic DNA sequence; 
a processor which: 

accesses a pattern database comprising a plurality of patterns of amino acids; 
translates an open reading frame (ORF) of said DNA sequence into an amino acid 

translation; and 

locates in said amino acid translation occurrences of said patterns from said 
pattern database; 

determines whether said open reading frame includes a putative gene based on a 
number of said patterns of amino acids located in said amino acid translation of said ORF, and/or 
weighted values associated with said patterns of amino acids located in said amino acid 

translation of said ORF; and 

a display device for displaying an output of said processor, said output including an 

occurrence of said patterns in said amino acid translation. 

wherein said patterns comprises patterns derived using a Teiresias algorithm. 
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wherein said open reading frame (ORF) comprises a portion of said DNA sequence 
between a start codoti and a stop codon, 

wherein said processor identifies a match of a pattern from said pattern database in said 
amino acid translation by usixig a predetermined pattern matching algorithm, 

wherein each pattern is assigned a weight depending upon a relevance of said pattern in 
detennining whether said ORF comprises a putative gene, and 

wherein said ORF is reported as a putative gene when either a predetermined number of 
pattern matches is identified in said amino acid translation, or a sum of weights corresponding to 
all patterns with matches in said amino acid translation exceeds a predeteimined threshold. 
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